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Abstract 

We consider the problem of controlling the inventory of a single item 
with stochastic demand over a single period. Most of the research on 
single-period inventory models has focused on the case in which demand 
distribution parameters are known. Nevertheless, it is clear that the ap- 
plicability of these models directly depends on the accuracy of demand 
parameters estimation. In this work, we introduce a novel strategy to 
address the issue of demand estimation in single-period inventory opti- 
mization problems. Our strategy is based on the theory of statistical 
estimation. We assume that the decision maker is given a set of past 
demand samples and we employ confidence interval analysis in order to 
identify a range of candidate order quantities that, with prescribed con- 
fidence probability, includes the real optimal order quantity for the un- 
derling stochastic demand process with unknown parameter. In addition, 
for each candidate order quantity that is identified, our approach can pro- 
duce an upper and a lower bound for the associated cost. We apply our 
novel approach to three demand distribution in the exponential family: 
Binomial, Poisson, and Exponential. For two of these distributions we 
also discuss the extension to the case of unobserved lost sales. Numerical 
examples are presented in which we show how our approach complements 
existing strategies based on maximum likelihood estimators or on Bayesian 
analysis. 

keywords: inventory control; newsvendor problem; sampling; confidence 
interval analysis; demand estimation. 
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1 Introduction 



We consider the problem of controlling the inventory of a single item with 
stochastic de mand over a single p eriod. This problem is known as the "Newsven- 
dor" problem Silver et al.l Il998l | , since it is the prototype of the problem faced 
by a newsvendor trying to decide how many newspaper to stock on his stand 
before observing demand. The newsvendor faces costs if he orders too much 
or if he orders too little. This problem therefore consists in deciding the size 
of a single order that must be placed before observing demand when there are 
overage and underage c osts. A thorou gh literature review on the Newsvendor 
problem is presented by iKhouial |2onnj . Most of the research on single-period 
inventory models has focused on the case in which demand distribution param- 
eters are known. Nevertheless, it is clear that the applicability of these models 
directly depends on the accuracy of demand parameters estimation. 



The import ance of parameter estimation has b een pointed ou t in several 



works [see e.g. iFisher et al.l , Il994 iBerk et al.l . l2007l | . According to I Berk et al 



2007| . there are two main approaches to demand estimation in the inventory 



theory literature: the frequentist and the Bayesian approach. In the frequen- 
tist approach [see e.g. Nahmiasl . 1994L Agrawal and Smith . Il99fij a parametric 
demand distribution is empirically selected and point estimates (e.g. maximum 
likelihood or moment estimators) for the unknown parameters are obtained ac- 
cording to the observed data. In the Bayesian approach [see e.g. lHiiil . ll997L[l999l| 
a "prior" distribution is selected for the demand parameter, based on collateral 
data and/or subjective assessment. From the prior distribution a "posterior" 
distribution is derived as new demand data become available. The posterior 
distribution of the parameter is then used to construct, first, the posterior dis- 
tribution of the demand , and then to derive the optimal order quantity and 
objective function value. iHilll 1997| points out that the literature on the appli- 
cation of Bayesian methods to demand estimation in the context of inventory 
modeling is considerable in volume. Furthermore, most of the above mentioned 
works focus on the critical issue of unobserved lost sales and on the ability to 
incorporate in the estimation procedures suitable adjustments to accoun t for 
the u nobserved component of the d emand [Nahmiaa. .1994i. Agrawal and Smithl . 
1996| . Nevertheless, as discussed by Kevor kl |2010j . little research exists, to date, 
on the effect of demand estimation on optimal inventory policies for single-period 
inventory models. 



In this work, we introduce a novel strategy to address the issue of demand 
estimation in single-period inventory optimization proble ms. Our strate gy is 
based on the theory of statistical estimation introduced by iNeymanI 1937l | and 
it complements existing strategies based on maximum likelihood estimators or 
on Bayesian analysis. Consider a possibly very limited set of past demand ob- 
servations. Classical strategies based on maximum likelihood estimators or on 
Bayesian analysis would analyze these data and provide a single most-promising 
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order quantity an d an e stima ted cost associated with it; the latter being either 
a point estima te iHilll. 19971 or a confidence interval obtained via a normal 
approximation Kevork , 2010l |. These strategies are typically justified on the 
basis of some asymptotic convergence properties they feature. In contrast, our 
approach employs exact confidence intervals in order to identify a range of can- 
didate order quantities that, with prescribed confidence probability, includes the 
real optimal order quantity for the underlying stochastic demand process with 
unknown parameter(s). In addition, for each candidate order quantity that is 
identified, our approach computes an upper and a lower bound for the associ- 
ated cost. This range covers, according to the prescribed confidence probability, 
the actual cost the decision maker will incur if he selects that particular quan- 
tity. Our approach is therefore particularly appealing in all those situations in 
which the decision maker aims to assess, with suitable confidence level and from 
a (possibly limited) set of available data, the risk of exceeding a certain cost 
when a given candidate quantity is selected. Once the set of candidate optimal 
order quantities has been identified, it may comprise one or more elements. If it 
comprises more than a single value, expert assessment or any existing appr oach 



H, I1997L ll999llKevorMl2010( may 



based on classical estimation strategies [i.e. 
be employed to select the most promising of these values. However, we stress the 
fact that, in contrast to other approaches in the literature, our approach does 
not simply provide a point estimation, it provides instead complete informa- 
tion to the decision maker about the set of potentially optimal order quantities 
— according to the available data and to the chosen confidence level — and 
about the confidence interval for the expected cost associated with each of these 
quantities. Furthermore, in some cases it can be easily extended to account for 
unobserved lost sales. 



This work is structured as follows. In Section[2]we introduce the Newsvendor 
problem and its extension under sampled demand information. In Sections |3l 
[31 and[5]we discuss, respectively, the case of Binomial, Poisson and Exponential 
random demand with unknown distribution parameter. In Section [5] we discuss 
drawbacks of existing approaches in the literature, limitations of our research 
and future research directions. In Section [7] we draw conclusions. 
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2 The Newsvendor problem 



In this section, we shall summarize the key features of t he Newsvendor problem. 
For more details, the reader may refer to ISilver et alj |1998i] . Consider a one- 



period random demand d with mean /i and variance cr^. Let h be the unit 
overage cost, paid for each item left in stock after demand is realized, and let 
p be the unit underage cost, paid for each unit of unmet demand; we assume 
p,h> 0. Let g{x) = hx^ + px~ , where x^ = max(a;,0) and x^ = — min(x, 0). 
The expected total cost can be written as G{Q) = E[g{Q — d)], where E[-] 
denotes the expected value. 

It is a well-known fact that, for a continuous demand d, 

F{Q)^PT{d<Q} = -^^^l3. (1) 

If F is continuous there is at least one Q satisfying Eq. [U that is 

Q* = inf{Q > : F{Q) ^ /?}. 
For F strictly increasing, there is a unique optimal solution given by 

Q*^F-\13). (2) 



In practice, the probability distribution of the random demand d often has 
finite support over the set Nq = {0, 1, 2, . . . }. In this case it is useful to work 
with the forward difference AG{Q) = G{Q + 1) - G{Q), Q € Nq. It is easy to 
see that 

AG(Q) = h - {h+p)FT{d> j} 

is non-decreasing in Q, and that linig^oo AG{Q) = /i > 0, so an optimal solu- 
tion is given by Q* = minjQ € No : AG{Q) > 0} or equivalently 

Q* = min{Q e No : F{Q) > /?}. (3) 



2.1 Heuristics for order quantity selection 

Let us now consider the situation in which the decision maker knows the type 
of the random demand distribution (e.g. Binomial), but in which he does not 
know the actual values of some or all the parameters of such a distribution. 
Nonetheless, the decision maker is given a set of M past realizations of the 
demand. From these realizations he has to infer the optimal order quantity 
and, possibly, he has to estimate the cost associated with the optimal Q* he has 
selected. 
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2.1.1 Maximum likelihood approach 



A commonly adopted heuristic strategy for order quantity selection under sam- 
pled demand information consists in computing, from the available sample set, 
a point estimate for the unknown demand distribution parameter (s). T his may 
be done by using th e maximum likelihood estimator Le Cam . 199Cl| or the 



method of moments Newev and McFadden . 1986l |. For instance, assume that 



the available sample set comprises M observed past demand data, di, . . . , dA/, 
and that the demand is assumed to follow a Binomial distribution. The Bino- 
mial distribution comprises two parameters: the number of trials N and the 
success probability q. In the context of the Newsvendor problem, TV may be 
interpreted as the non-variant number of customers that enter the shop in a 
given day i £ {1, • ■ • , Af}, and q may be interpreted as the probability that a 
customer makes a purchase. Then, by assuming that N is known, the maximum 
likelihood estimator for parameter q is q = di/{MN). After computing q, 

the decision maker employs the random variable Bin{N] q) in place of the actual 
unknown demand distribution in Eq. [3] to compute the estimated optimal order 
quantity Q* and expected total cost G{Q*). 



2.1.2 Bayesian approach 



A Bayesian approach t o the Newsvendor problem under partial information is 
presented by iHilll |l997| . Hill's approach considers a "prior" distribution, based 
on collateral data and/or subjective assessment, for the unknown parameter(s) 
of the demand distribution. As new data are observed, the prior distribution is 
updated and a "posterior" distribution is generated. Hill then uses the posterior 
distribution for the unknown parameter for estimating the posterior distribution 
of the demand. Finally, the posterior distribution of the demand is used to 
estimate the value of the decision variable which optimizes the Newsvendor cost 
function. 

Other heuristic strategies for selecting a candidate order quantity can be 
adopted. For the sake of brevity, in this work we will focus on the two strategies 
discussed above. 

3 Binomial demand 



Consider a discrete random variable that follows a Bernoulli distribution. Ac- 
cordingly, such a variable may produce only two possible outcomes, i.e. "yes" 
and "no" , with probability q and 1 ~ q, respectively. This class of random vari- 
ables is particularly useful in representing so called "Bernoulli trials", which 
are experiments that can have one of two possible outcomes. These events can 
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be phrased into "yes or no" questions, such as "did the customer purchase the 
newspaper?" . 

Consider the foUowing situation: a Newsvendor has a pool of N customers 
that come every day to the stand. Each customer may buy or not a newspaper 
with probabihty q. It is a well-known fact that any experiment comprising a 
sequence of N Bernoulli trials, each having the same "yes" (resp. "no") proba- 
bility q (resp. 1 — q), can be re presented by a random variable Bin{N;q) that 
follows a Binomial distribution [Jeffrevj . 1961 1 with probability mass function 



Pi{Bin{N;q) = k} = (^)q''{l - q)^''' 



where k — 0, . . . , N. 



According to our previous discussion it is fairly easy to find the optimal 
order quantity Q* for a given random demand Bin{N; q). We shall now give 
a running example. Consider a random demand _Bm(50,0.5). Let h = 1 and 
p = 3, therefore /3 = 0.75. From Eq. [3] we compute Q* = 27 and the respective 
expected total cost G{Q*) = 4.4946. 

Let us now consider the situation in which the parameter q is not known. 
The decision maker is given a set of M past realizations of the demand. From 
these realizations he has to infer the optimal order quantity and, possibly, he 
has to estimate the cost associated with the optimal Q* he has selected. 

Since we operate under partial information it may not be possible to uniquely 
determine "the" optimal order quantity and the exact cost associated with it. 
Therefore, we argue that a possible approach consists in determining a range 
of "candidate" optimal order quantities and upper and lower bounds for the 
cost associated with these quantities. This range will contain the real optimum 
according to a prescribed confidence probability. 



3.1 Confidence intervals for the Binomial distribution 



Confidence interval analysis Nevmanl . 1937 . 1941 [ is a well established technique 
in statistics for computing, from a given set of experimental results, a range of 
values that, with a certain confidence level (or confidence probability), will cover 
the actual value of a parameter that i s being estimated. In the year s several 
techniques [Clopper and Pearson! 11934 . iGarwoodl . [l936l iTrivedi [2OOIL etc.] for 
building confidence intervals for a given sample set have been proposed. 



Approximate techniques for building confidence intervals [see e.g. lAgresti and Coull 

1998| become relevant because, especially with small sample sizes, an exact con- 
fidence interval may be unnecessarily conservative. In this work, we focus on 
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the exact interval and we leave the analysis of the benefits brought by the use 
of approximate intervals as future research. 



A method that is commonly classified as the "e xact confidence intervals" fo r 
the Binomial distribution has been introduced bv lClopper and Pearson 1934 1. 



This method uses the Binomial cumulative distribution function in order to 
build the interval from the data observed. The Clopper-Pearson interval can be 
written as {qib,qub), where 

qib - mm{q\ Pr{Bin{N; q) > X} > {I - a)/2}, 
qub = max{g| Pr{Bin{N; q) < X} > {1 - a)/2}, 

and X is the number of successes (or "yes" events) observed in the sample and 
a is the confidence probability. Note that we assu me gib = when X = 
and that qub = N when X = N. As discussed by Agresti and CouU (l998| . 



this interval can be also expressed using quantiles from the Beta distribution. 
More specifically, the lower endpoint is the (1 — a)/2-quantile of a Beta dis- 
tribution with shape parameters X and N ~ X + 1, and the upper endpoint 
is the (1 + a)/2-quantile of a Beta distribution with shape parameters X + 1 
and N ~ X. Furthermore, the Beta distribution is, in turn, related to the 
F-distrib ution so a third form ulation of the Clopper-Pearson interval, also dis- 
cussed by Agresti and Coulll 1998.] , uses quantiles from the F distribution. 



It is fairly intuitive to see that the "quality" of a given confidence interval 
is directly related to its size. The smaller the interval, the better the estimate. 
In general, confidence intervals that have symmetric tails (i.e. with associated 
probability (1 — a)/2) are not the smallest possible ones. A large literature 
exists on the topic of determining the small est possible inte rvals for a given 
parameter/distribution combination [see e.g. Zielinski . 2010j |. The discussion 



that follows is somehow independent of the particular interval adopted. For the 
sake of simplicity, we will adopt intervals having symmetric tails. 



3.2 Solution method employing statistical estimation based 
on classical theory of probability 

We shall now employ Clopper-Pearson interval for computing an upper and a 
lower bound for the optimal order quantity Q* in a Newsvendor problem under 
partial information. The confidence interval for the unknown parameter q of 
the Binomial demand Bin{N\q) is simply {qib,qub) where 

qih ^ mm{q\Pi{Bin{MN;q) > X} > {1 - a)/2}, 
qub = max{q\Pr{Bin{MN;q) < X} > (1 -a)/2}, 

and X ~ '^iLi di- Let Q'^j^ be the optimal order quantity for the Newsvendor 
problem under a Bin(N, qib) demand and Q* ^ be the optimal order quantity for 
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the Newsvendor problem under a Bin{N,qub) demand. Since AG{Q) is non- 
decreasing in Q, according to the available information with confidence proba- 
bility a the optimal order quantity Q* is a member of the set {(5*6, • • • , Qub}- 

We shall now compute upper (c„(,) and lower (cib) bounds for the cost 
associated with a solution that sets the order quantity to a value in the set 
{Qib, ■ . ■ , Qub}- Let us write the cost associated with an order quantity Q, 

Q N 

G{Q) = hJ2 Pr{Bin{N; q) = i}{Q - i) + pj^ Pr{Bin{N; q) = i}{i-Q). 

1=0 i=Q 

Then, consider the function 

Q N 

Ggiq) = hY^ MBin{N- q) =i}{Q- i) +p ^ Pr{Sm(7V; q) = i}{i- Q), (4) 

i=a i=Q 

in which the order quantity Q is fixed and in which we vary the "success" 
probability q G (0, 1). It can be proved that GQ{q) is convex in the continuous 
parameter q; this is trivially true when Q = N . The proof for < Q < is 
given in the Appendix I. 

Although it is possible to prove that Gq(?) is convex in q, there is no closed 
form expression for finding the q* that minimizes this function. Nevertheless, 
due to its convexity in q, it is clearly possible to use convex optimization ap- 
proaches to find the q* that minimizes or maximizes this function over a given 
interval. 

Let us consider the confidence interval {qib,qub) for the parameter q of the 
Binomial demand. For a given order quantity Q, consider the value 

9Q,min = arg min Ggiq) (9Q,max = arg max GQ{q)] 

q&(gib,qub) \ qe{qib,qub) J 

that minimizes (maximizes) Gq^q) for q G {qib,qub)- With confidence proba- 
bility a, Gq((7qj^;^) and GQ{qQ ^^^) represent a lower and an upper bound, 
respectively, for the cost associated with Q. 

By recalling that the optimal order quantity Q* is, with confidence probabil- 
ity a, a member of the set {Qij,, ■ ■ ■ , Qub}^ easy to compute upper (c*;,) and 
lower (c*j) bounds for the cost that a manager will face, with confidence proba- 
bility a, whatever order quantity he chooses in the candidate set {Q*;,, . . . , <3^(,}- 
The lower bound is 



the upper bound is 
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It should be emphasized that, when the confidence interval {qw, qub) cov- 
ers the real parameter q of the binomial demand we are estimating, then the 
set {Q^f,, . . . , Qui,} comprises the optimal order quantity Q* and the interval 
(c*^, c*^) comprises the real cost associated with every possible order quantity 
in {Q^f,, . . . , Q*ub\- Given the way confidence interval (qib, qub) is constructed, it 
is guaranteed that this happens with probability a. 

Of course, by increasing the number M of past observations, we can de- 
crease the size of confidence interval {qib,qub)- As a direct consequence, also 
the cardinality of the set {Q^^, . . . , Q^hl decreases. In the ideal case, this set 
comprises a single candidate order quantity Q* that with confidence probability 
a represents an optimal solution to the problem and has a cost comprised in 
the interval (Gq- ((?Q.,min)> Gq- ((Jq-^^^x))- 

Finally, consider the case in which unobserved lost sales occurred and the M 
observed past demand data, di, . . . , cZm, only reflect the number of customers 
that purchased an item when the inventory was positive. The analysis discussed 
above can still be applied provided that the confidence interval for the unknown 
parameter q of the Bin(N] q) demand is computed as 

qib = min{g| Pr{i?*n(E^li Nf,q) > X} > (1 - a)/2}, 
qub - max{q| Pr{Bzn(E^ii N,;q) < X} > {1 - a)/2}, 

where Nj is the total number of customers that entered the shop in day j — 
for which a demand sample dj is available — while the inventory was positive. 



3.3 Algorithm 



The procedure to compute, under the prescribed confidence probability a, a 
candidate set Q of order quantities and upper (c*^) and lower (c*j,) bounds for 
the cost a manager faces when he selects one of these quantities is presented 
in Algorithm [T] The code initially computes Clopper-Pearson interval {qib,qub) 
by exploitin g the relationship b etween the Binomial distribution and the Beta 
distribution Evans et al. . 200Cl| |. Then it computes the critical fractile /? and 



the upper and lower bound for the set Q of candidate order quantities. Finally, 
it iterates through the elements of this set to compute the upper (c*;,) and 
lower bound (cj*^) for the estimated cost associated with these candidate order 
quantities. 

In general, the set Q = . . . , Q^i,} may comprise a significant number of 
elements, especially if a very limited amount of samples is available. A decision 
maker may then employ one of the heuristic strategies discussed in Section 12.11 
in order to determine the most promising quantity in this set. 
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Algorithm 1: Newsvendor under incomplete information. Binomial de- 
mand^ 

input : M past demand realizations d;, i = 1, . . . , M; 
the number of customer per day: N\ 
the holding cost: h\ 
the penalty cost: p; 
the confidence probability: a. 
output: the set Q of candidate order quantities; 

the interval (c;*j,c*^) for the estimated cost. 

begin 

Ei=i,....M'^«; 
6^Af7v-E.=i,...,Mrf^; 

qub <— InverseCDF [BetaDistribution(a + 1, &) , (1 + q;)/2] ; 
qih -S— InverseCDF [BetaDistributionCa, 6+ 1) , (1 — q;)/2] ; 
fi^plip + h); 

Q*^ InverseCDF [BinomialDistribution(A^, q^b) ,/?] ; 
Q;*f, ^InverseCDF [BinomialDistributionCA^, q/^) ; 

^ oo; 
for eac/i Q G Q do 

9Q,max ^ argmax^ef,,^,,^^) GQ(q); 
9Q,min ^ argmin^g(^,^_,^^) GQ(q); 

L ^ min(c;*ft,GQ(g^ „ji„)); 



3.4 Example 

We consider a simple example involving the Newsvendor problem under Bino- 
mial demand. Assume that, in our problem, h — 1, p = 3, and the demand 
follows a Bin{bO,q) distribution, in which parameter q is unknown. We are 
given 10 samples for the demand, which we may use to determine the optimal 
order quantity Q*. The samples are {28,28,24,27,25,26,28,28,23,27}. The 
real value for parameter q, which is used to generate the 10 samples is 0.5. 
Accordingly, the optimal order quantity Q* is equal to 27 and provides a cost 
equal to 4.4946. 

We consider a — 0.9. By using Algorithm [T] we compute the set of candidate 
order quantities Q — {27, 28, . . . , 31} and the confidence interval for the esti- 
mated cost (c^f,,c*j) = (4.4268,7.2205). Among the candidate quantities in Q, 
both the heuristics presented in Section 12.11 identify Q* — 29 as the candidate 
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optimal quantity. By using the approach discussed in Section [321 '^e compute 
the a confidence interval for the estimated cost, which is 

{GQ^('ih',^J^^Q'^'^h',n.j) - (4.4487,4.9528). 



Clearly, the information on the minimum and maximum cost associated with 
each order quantity in Q lets the decision maker perform a more educated choice. 
For instance, if a manager is not a risk-taker, he may decide select the order 
quantity Q*, for which the a confidence interval for the estimated cost has the 
lowest possible upper bound G^,(q% ). In the above example, this is stih 
29, but in general it may be a different order quantity. 



Less conservative, but approxi mate, confidence intervals ma y be obtained 
by replacing the Clopper-Pearson Clopper and Pearsonl Il934| interval with 
the Agresti-Coull Agresti and CouU . 1998l | interval for the Binomial parame- 
ter. The maximum likelihood and the Bayesian heuristics do not employ con- 
fidence intervals for selecting the candidate order quantity. Therefore they are 
not affected by this choice and the selected order quantity remains Q* = 29. 
The a confidence interval for the estimated cost associated with Q* = 29 is 



G. 



Q* ^^3*, mill)' 



('^a^j)- (4-4487,4.9155). 



Q* ,max 

than that produced by using the Clopper-Pearson interval 



This interval is 7.3% smaller 



4 Poisson demand 



In many practical contexts, a random demand distributed according to a Pois- 
son law may become relevant. A random demand Poisson(X) is said to be 
distributed according to a Poisson law with rate parameter A > 0, if its proba- 
bility mass function is 

Pr{d= /c} = e"^ — , fc = 0,l,2,...,cx3. 
k\ 

The Poisson distribution is the limiting distribution of the Binomial distribu- 
tion when N is large and q is small. In this case, the parameters of the two 
distributions are linked by the relationship A = qN. We recall that the expected 
value of d is A and that the standard deviation of d is \/A. 

By using Eq. [3l we easily obtain the optimal order quantity Q for a given 
demand d. We shall give an example. Consider a demand d that follows a 
Poisson{50) distribution. Let h = 1 and p — 3, therefore (3 — 0.75. The 
optimal order quantity is Q* ~ 55. Furthermore, by noting that 

oo 

G{Q) = h{Q - A) + {h -t- p) ^ (1 - PY{Poisson{X) < i}), 

i=Q 
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the optimal cost is G{Q*) = 9.1222. 



We shall now consider, also in this case, the situation in which the parameter 
A is not known. Instead, the decision maker is given a set of M past realizations 
of d. As in the previous case, from these realizations he has to infer the range 
of "candidate" optimal order quantities and upper and lower bounds for the 
cost associated with these quantities. This range will contain the real optimum 
according to a prescribed confidence probability. 



4.1 Confidence intervals for the Poisson distribution 



As in the previous case, we discuss the exact confidence interval that can be used 
to estimate the rate par ameter A of the Poisson distribution. This confidence 
interval was proposed bv lGarwood 1936| and takes the following form. Consider 
a set of M samples di drawn from a random demand d that is distributed 
according to a Poisson law wit h unknown parameter A. We rewrite d = X^i^o 
According to GarwoodI |l936 |. the confidence interval for A is (A/b, A„f,), where 

Xib = min{A| PT{Poisson{MX) > d} > (1 - a)/2}, 
A„b = max{A| PT{Poisson{MX) < J} > (1 - a)/2}. 

Th is interval can b e expressed in terms of the chi-square distribution, as shown 
by [Garwood 1936l |. Let Xn denote the chi-square distribution with n degrees 
of freedom, and G'~^(x^, •) denote the inverse cumulative distribution function 
of Xn- We can write 

G-HxL-.(l-«)/2) 



Ajb — 



2M 



a)/2) 



2M 



Furthermore, it is p ossible to ex press this interval using quantiles from the 
Gamma distribution Swift . 2009| . More specifically, the lower endpoint is the 
(1 — Q!)/2-quantile of a Gamma distribution with shape parameter d and scale 
parameter 1/Af, and the upper endpoint is the (1 + a)/2-quantile of a Gamma 
distribution with shape parameter c? -f 1 and scale parameter 1/M . Swift lists 
a number of existing approaches for building approximate intervals that are 
less conservative than Garwood's one and he also suggests strategies to shorten 
Garwood's interval by choosing suitable asymmetric tails Swiftl . 



4.2 Solution method employing statistical estimation based 
on classical theory of probability 

The method for computing an upper and a lower bound for the optimal order 
quantity Q* in a Newsvendor problem under Poisson demand and partial infor- 
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mation on parameter A can be carried out in a similar fashion to the Binomial 
case given in Section [3^ Consider Garwood's confidence interval (Ajb, A„b) for 
the unknown parameter A of the Poisson demand. Let Q'^i^ be the optimal order 
quantity for the Newsvendor problem under a Poisson(Xib) demand and (5*j, be 
the optimal order quantity for the Newsvendor problem under a Poisson(Xub) 
demand. With confidence probability a the optimal order quantity Q* is a 
member of the set {Qf^, . . . , (5*J. 

Consider the cost associated with an order quantity Q, 

Q oo 

G{Q) = /i^Pr{Poissor7,(A) = i}(g - i) + p ^ Pr{Poisson(A) = i}{i~Q). 
Also in this case we can prove that Gq{X) 

Q oo 

Gq(A) = h^Pr{Poisson{X) = i}{Q - i) + p ^ Pr{Poissori(A) = - Q), 

1=0 i=Q 

(5) 

is convex in A. The proof is given in the Appendix II. Therefore upper (c„b) 
and lower (cjb) bounds for the cost associated with a solution that sets the order 
quantity to a value in the set {Q*^,, . . . , QJif,} can be easily obtained by using 
convex optimization approaches to find the A* that minimizes or maximizes this 
function over a given interval. 

Also in this case, consider the case in which unobserved lost sales occurred 
and the M observed past demand data, c?i , . . . , dm , only reflect the number of 
customers that purchased an item when the inventory was positive. The analysis 
discussed above can still be applied provided that the confidence interval for the 
unknown parameter A of the Poisson{X) demand is computed as 

Xib = min{A| VT:{Poisson{MX) > J} > (1 - a)/2}, 
Xub = max{A| Pr{Poisson(MA) < 4 > (1 - a)/2}. 

where M — X^jli ^i' ^^^d Tj £ (0, 1) denotes the fraction of time in day j — 
for which a demand sample dj is available — during which the inventory was 
positive. 



4.3 Algorithm 

The computational procedure for Poisson demand is presented in Algorithm [5J 
The code initially computes Garwood's interval (Aib,A„b) by exploiting the re- 



lation ship between the Poisson distribution and the Gamma distribution [Swift 



|2009j. Then it computes the critical fractile j3 and the upper and lower bound 
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Algorithm 2: Newsvendor under incomplete information. Poisson de- 
mand^ 

input : M past demand realizations di, i ~ 1, . . . , M; 
the holding cost: h; 
the penalty cost: p; 
the confidence probability: a. 
output: the set Q of candidate order quantities; 

the interval (c;*t,c*j) for the estimated cost. 

begin 

^ X]i=l,...,M '^i' 
b ^ M] 

Xub -^InverseCDF [GammaDistribution(a + 1, 1/6) , (1 + a)/2] ; 
Xih 4— InverseCDF [GcunmaDistributionCa, 1/b) ,(1 — q:)/2] ; 
I3^p/{p + h); 

Q*^ -s— InverseCDF [PoissonDistribution(Aub) ,/?]; 
(5j*jj -r— InverseCDF [PoissonDistribution(A/(,) ,/?]; 

c;^ 4- oo; 

for each Q E Q do 

•^Q,max ^ arginax^g(^,^^^^^) G'q(A); 
^Q,mm ^ argminAe(A,,,A„,) Gq(A); 
<6 ^ max(c;f„GQ(A^ 



for the set Q of candidate order quantities. Finally, it iterates through the el- 
ements of this set to compute the upper (c*f,) and lower bound (c;*{,) for the 
estimated cost associated with these candidate order quantities. 



4.4 Example 

We consider a simple example involving the Newsvendor problem under Poisson 
demand. In our problem, h — 1, p ^ 3, and the demand follows a Poisson(X) 
distribution, in which parameter A is unknown. We are given 10 samples for the 
demand, which we may use to determine the optimal order quantity Q*, these 
are {51,54,50,45,52,39,52,54,50,40}. The real value for parameter A, which 
is used to generate the samples, is 50. Accordingly, the optimal order quantity 
Q* is equal to 55 and provides a cost equal to 9.1222. 

We consider a = 0.9. By using Algorithm [2] we compute the set of candi- 
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date order quantities Q — {50, 51, ... , 57} and the confidence interval for the 
estimated cost (cff,, c*;,) = (8.6803, 14.6220). Let us consider a heuristic strategy 
based on the maximum Ukehhood estimator. In the case of the Poisson distribu- 
tion, this estimator takes the fohowing convenient form, -p- '^f^i di, where c?j, 
for i = 1, . . . , M are the observed samples. Therefore, according to the above 
samples, the maximum likelihood estimator for A is 48.7. By using a demand 
that follows a Poisson distribution with mean rate A = 48.7 in Eq. |3]we ob- 
tain a candidate optimal order quantity Q* = 53 and an estimated expected 
cost of 9.0035. As previously, remarked, such a strategy does not provide any 
information on the reliability of the above estimates. In fact, the actual cost 
associated with this order quantity, when A — 50, is 9.3693. Conversely, our ap- 
proach reports the a confidence interval (8.9463, 11.0800) for the expected cost 
associated with Q* = 53, which in this case includes the actual cost a decision 
maker will face in case he decides to order 53 units. 



Similar issues occur for the Bayesian approach presented bv lHilll |1997l |. For 
the sample set presented above, this approach suggests to order 54 units and 
estimates a cost of 9.4764. Also this strategy does not provide any information 
on the reliability of these estimates. For an order quantity of 54 units, our 
approach reports the a confidence interval (9.0334, 10.3374) for the expected 
cost, which include the actual cost 9.1530 associated with this quantity when 
A = 50. 



5 Exponential demand 

A random demand Exp{X) is said to be distributed according to an Exponential 
law with rate parameter A > if its probability density function is 

/(A,A:) = Ae~^^ k > 0; 

the expected value of Exp{\) is 1/A. 

In the context of the Newsvendor, the exponential distribution may occur 
in two cases. An exponentially distributed random variable Exp{X) with rate 
parameter A can represent the inter-arrival time between two unit demand oc- 
currences in a Poisson process with rate parameter A. Alternatively, an expo- 
nentially distributed random variable Exp{X) can represent the total demand 
over the Newsvendor planning horizon. It is clear that the first case can be 
easily reduced to the case of a random demand that follows a Poisson distribu- 
tion with rate parameter A. Such a situation can be handled by following the 
discussion in the previous section. In the second case, by using Eq. [21 we easily 
obtain the optimal order quantity Q* for Exp{X). This is simply 
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We shall give an example. Consider a random demand Exp{l/50) with mean 
50. Let h—1 and p = 3, therefore h/ (h+p) = 0.25. The optimal order quantity 
is Q* = 69.32. Furthermore, consider the cost function 

rQ foo 
G{Q) = h (Q-i)f(X,i)di+p it-Q)f{X,i)di, 



where /(A, •) denotes the probability density function of Exp(X). Rewrite 

G{Q) = h{Q -h + {h + p) /"(I - F{\ i))dz, 
A Jq 

where F{\, •) denotes the cumulative distribution function of Exp{X). By noting 
that 

the optimal cost is G{Q*) — 69.32. 

We shall now consider, also in this case, the situation in which the parameter 
A is not known and the decision maker is given a set of M past realizations of 
the demand. As in the previous case, from these realizations he has to infer the 
range of candidate optimal order quantities and upper and lower bounds for the 
cost associated with these quantities. This range will contain the real optimum 
according to a prescribed confidence probability. 



5.1 Confidence intervals for the Exponential distribution 

We discuss the exact confidence interval that can be used to estimate the rate 
parameter A of the Exponential distribution. Consider a set of M samples di 
drawn from a random variable that is distributed according to an Exponential 
law with unknown parameter A. We rewrite d = X]f=o Since the sum of M 
i.i.d. exponentially distributed random variables with rate parameter A follows 
a Gamma distribution with shape parameter M and scale parameter 1/A, the 
a confidence interval for the unknown parameter A is (Ajt,, A„f,), where 

Xib = min{A| Pi-{Gamma{M,l/X) > J} > (1 - a)/2}, 
Xub = max{A| PT{Gamma{M,l/X) < d} > {1 ~ a)/2}. 

A closed form expression for this confidence interval - — th at employs quantiles 
from the distribution — was proposed by Trivedil 2001 . chap. 10] and takes 
the following form. Let Xn denote the chi-square distribution with n degrees of 
freedom, and G~^{Xni ') denote the inverse cumulative distribution function of 
Xn- We can write 

A;, 



G-i(xiM,a-«)/2) 



2d 
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_ G-^(xLf,(l + «)/2) 
^"^ ~ M • 

Furthermore, it is possible to express this interval using quantiles from the 
Gamma distribution. More specifically, the lower endpoint is the (1 — Q;)/2- 
quantile of a Gamma distribution with shape parameter M and scale parameter 
and the upper endpoint is the (1 + a)/2-quantile of a Gamma distribution 
with shape parameter M and scale parameter 1 /d. 



5.2 Solution method employing statistical estimation based 
on classical theory of probability 



Consider the confidence interval (Ajh, \ub) for the unknown parameter A of the 
Exponential demand. Let (5*6 be the optimal order quantity for the Newsvendor 
problem under an Exp{Xub) demand and (5*{, be the optimal order quantity for 
the Newsvendor problem under an Exp{Xib) demand. Recall that A is a rate, 
this is the reason why the optimal order quantity for the Newsvendor problem 
under an Exp{Xib) gives an upper bound Q* ^ for the real optimal order quantity. 
Clearly, the optimal order quantity Q* lies in the interval iQii,,Qub)- 



Let us write the expected total cost associated with an order quantity Q for 
a given demand rate A > 0, 



G,{Q) = h{Q-^) + {h+p) [ (l-F(A,*))di, 
it is known that this function is convex. Then, consider the function 

1 r°° 

GQ{X) = h{Q--) + ih+p) {l-F{X,i))di, (8) 

•A Jq 

in which the order quantity Q is fixed and in which we vary the demand rate 
A > 0. Unfortunately, Gq{X) is not convex in the continuous parameter A. 
Nevertheless, we prove a number of properties for this function. 

Theorem 1. lim^-^o G'q(A) — oo, lim^-^oo Gq(A) = , the function admits 
a single global minimum A*, it is strictly increasing for A > A* and strictly 
decreasing for A < A* . 



The proof is given in the Appendix IIL 



Because of the properties of Gq (A) introduced in Theorem[T]we can employ a 
simple line search procedure in order to find the A* that minimizes or maximizes 
this function over a given interval. 

Since the optimal order quantity Q* is, with confidence probability a, a value 
in the interval (Q;*^, Q^b)^ therefore easy to compute upper (c*;^) and lower 
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(c*^) bounds for the cost that a manager will face, with confidence probability 
a, whatever order quantity he chooses in this interval. 

Theorem 2. The lower hound is 
the upper bound is 

c,*fc = max{GQ.^(A,h),GQ.^(A„b)}. 
The proof is given in the Appendix III. 

Unlike the previous cases, it is not straightforward to extend the above rea- 
soning to the case in which unobserved lost sales occurred and the M observed 
past demand data, di, . . . , dM, only reflect the number of customers that pur- 
chased an item when the inventory was positive. This is due to the fact that 
the distribution of the general sum of Exponential random variables is not Ex- 
ponential, rather it is Hypoexponential. We therefore leave this discussion as a 
future research direction. 



5.3 Algorithm 

The computational procedure for Exponential demand is presented in Algorithm 
[3l The code initially computes the confidence interval {Xib, A„b) by exploiting the 
relationship between the Exponential distribution and the Gamma distribution. 
Then it computes the critical fractile /3 and the upper and lower bound for the 
set Q of candidate order quantities. Finally, it computes the upper (c*^,) and 
lower bound (q*^) f or the estimated cost associated with these candidate order 
quantities by exploiting Theorem [2] 



5.4 Example 

We consider a simple example involving the Newsvendor problem under Expo- 
nential demand. In our problem, h — 1, p = S, and the demand is a random 
variable Exp{X) for which parameter A is unknown. We are given 10 samples 
for the demand, which we may use to determine the optimal order quantity Q* , 
these are 

{39.79, 39.26, 32.21, 0.51, 107.03, 72.87, 45.23, 20.12, 26.46, 56.80}. 

The real value for parameter A, which is used to generate the samples, is 1/50. 
Accordingly, the optimal order quantity Q* is equal to 69.31 and provides a cost 
equal to 69.31. 
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Algorithm 3: Newsvendor under incomplete information. Exponential 
demand. 



input : M past demand realizations d;, i = 1, . . . , Af ; 

the holding cost: h; 

the penalty cost: p; 

the confidence probability: a. 
output: the set Q of candidate order quantities; 

the interval (c;*t,c*j) for the estimated cost. 

begin 

a ^ M; 

\ub InverseCDF [GcunmaDistributionCa, , (1 + q:)/2] ; 
A/b 4— InverseCDF [GcunmaDistributionCa, ,(1 — q:)/2] ; 
fi^p/{p + h)- 

(5*(j -s— InverseCDF [PoissonDistribution(A„b) ,/?]; 
Qlf^ 4— InverseCDF [PoissonDistributionCAjt,) ,/?]; 

Q4-{g;„...,g:j; 

<6 ^ max{GQ.^(A/f,),G'Q.^(A„6)}; 



We consider a = 0.9. The a confidence interval for the demand rate A is 
(0.0123211,0.0356664). By using Algorithm [3] we compute the range of candi- 
date order quantities Q — (38.86,112.51) and the confidence interval for the 
estimated cost {c'li„c*^i,) — (38.86, 158.81). A plot of the expected total cost as 
a function of A e (0.0123211, 0.0356664) and of Q e (38.86, 112.51) is shown in 
the Appendix IV. Let us consider a heuristic strategy based on the maximum 
likelihood estimator for the demand rate A. In the case of the Exponential 
distribution, this estimator takes the following form, A — , where di, 

l^i=l di 

for i = 1, . . . ,N are the observed samples. Therefore, according to the above 
samples, the maximum likelihood estimator for A is A = 0.0227099. By using 
a rate A — 0.0227099 in Eq. [3] we obtain a candidate optimal order quantity 
Q* = 61.04 and an estimated expected cost of 61.04. As previously, remarked, 
such a strategy does not provide any information on the reliability of the above 
estimates. In fact, the actual cost associated with this order quantity, when 
A = 1/50, is 70.03. Conversely, our approach reports the a confidence interval 
(45.71, 132.90) for the expected cost associated with Q* = 61.04, which in this 
case includes the actual cost a decision maker will face in case he decides to 
order 61.04 units. 



Similar issues occur for the Bayesian approach presented by |Hi H \l99l\ . For 



the sample set presented above, this approach suggests to order 59.14 units and 
estimates a cost of 65.05. Also this strategy does not provide any information 
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on the reliability of these estimates. For an order quantity of 59.14 units, our 
approach reports the a confidence interval (44.71, 134.63) for the expected cost, 
which includes the actual cost 70.42 associated with this quantity when A = 
1/50. 



6 Discussion and future works 



In this section we shall first discuss the drawbacks of the heuristics discussed in 
Section 12.11 and the advantages of our strategy, based on Neyman's method of 
confidence intervals, with respect to these existing approaches to the Newsven- 
dor problem under sampled demand information. Secondly, we will discuss the 
limitations of our work and possible future research lines. 



6.1 Comparison with a Bayesian and a frequentist ap- 
proach 



The Bayesian approach proposed by iHill (l997 1 presents several theoretical and 



practical drawbacks that we are now going to enumerate. Theoretical drawbacks 
of the Bayesian approach to parameter estimation are illustrated by iNevmaii 



19371 p. 343]. The first issue raised by Neyman is the fact that the unknown 



parameter a of the demand distribution is not a random variable, therefore 
assigning a prior or posterior distribution to it has no meaning. In fact, one 
may try to employ the prior probability distribution — for instance a uniform 
distribution in Hill's case — to compute Pr{a < A}, where A G (0,1). Of 
course, a is not a random variable therefore this probability should be either 1 
or depending on A. Therefore, talking about prior or posterior distribution 
for a can only represent an approximation. Although, in Bayesian probability, 
prior and posterior distributions are meant to represent a "state of knowledge" , 
that is decision maker's uncertainty about the unknown quantity a, problems 
arise immediately, as soon as one tries to interpret the meaning of the prior and 
posterior distribution in light of classical probability theory. Neyman, in fact, 
also points out that, even if the unknown parameter a is a random variable, the 
po sterior dist ribution /((i|6) for the random demand d, computed as illustrated 



in iHilll jl997| , does not generally have the property serving as a definition of 
the elementary probability law of the observed data h. In particular, this dis- 
tribution is not compatible with the classical definition of probability, in the 
sense that, if we repeat an experiment an infinite number of times, the observed 
frequency does not converge to the probability predicted by such a distribution. 
For instance, consider once more the example presented in Section Instead 
of having a single set of demand observations, we now consider M experiments, 
with M large, in each of which we observe 10 demand realizations. Let hi be 
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the demand set observed in experiment i — 1, . . . , Af. For each experiment i, 
we construct the posterior distribution, fi{di\bi), of the random demand di from 
10 de mand observations, according to the Bayesian strategy discussed by Hill 



19971 ]. Neyman points out that, in general, when we select two values Ai and 
A2, and we compute pi = Pr{Ai < di < A2}, the quantity ^/Mj^fLiPi will 
not converge — as it should, according to the law of large numbers — to its 
real value Pr{Ai < Bin{50;q) < A2}, where q is the real value of a. In our 
example, we set Ai = 26 and A2 = 28. Pr{Ai < Sm(50;0.5) < A2} = 0.1747. 
Nevertheless, if we estimate pi = Pr{Ai < di < A2 I in each e xperiment i by 
using the posterior distribution f{di\b) discussed bv IHilll |l997| for a Binomial 
demand, the estimated probability ^/MJ2i=iPi converges to 0.2654, when the 
experiment is repeated M — 100000 times. This essentially differs from the real 
value, as Neyman remarks. For this reason, using the posterior distribution in 
place of the original distribution in the problem of interest is a strategy that may 
lead to misleading results. In practice, a direct consequence is that it becomes 
hard to assess "how good" , in a probabilistic sense, the estimate produced is. It 
is fairly simple to observe this latter fact by considering once more the example 
presented in Section By using Hill's approach in HiU [199"^, we obtain the 
posterior distribution for the random demand out of the 10 samples, and then 
we use this distribution in order to compute an estimate of Q*, which in the 
particular example we consider is 29. We also employ the posterior distribution 
in the Newsvendor cost function in order to estimate the cost associated with 
the optimal order quantity selected; this turns out to be 4.6692. Unfortunately, 
in line with Neyman's observations. Hill's approach does not give any hint on 
the "quality" of the estimates produced, which in this particular case are rel- 
atively poor. In particular, we do not know with what frequency the order 
quantity will over/underestimate the real optimum order quantity, and how far 
the prescribed order quantity is likely to be from the real optimum order quan- 
tity. The same, of course, holds for the estimated optimum cost. Our approach 
based on Neyman's framework, in contrast, suggests that, with 90% confidence, 
the optimal order quantity lies between 27 and 31, and that the optimum cost 
— which we recall is 4.4946 — lies in the interval (4.4268,7.2205). Then, when 
a given heuristic suggests to order 29 units — for instance according to Hill's 
Bayesian approach — our approach also provides a 90% confidence interval es- 
timation of the cost associated with this decision, that is (4.4487, 4.9528). This 
interval actually covers, in this specific case, the real cost associated with the 
decision of ordering 29 units, i.e. 4.8904. In general, the interval will cover the 
real cost according to the prescribed confidence probability. Conversely, Hill's 
approach suggests that the cost associated with ordering 29 units is 4.6692, but 
it provides no indication on the reliability of this esti mate . Nev ertheless, we 



must underscore the fact that Hill's Bayesian approach Hilll . 1997l |. although it 



does not provide likelihood guarantees, is typically a very effective heuristic for 
order quantity selection. 

Similar issues arise in classical frequentist approaches. For the example 
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in Section 13.41 an approach based on the maximum hkehhood estimator, as 
remarked, suggests to order 29 units. The estimated cost according to this 
strategy is 4.4614. Neverth eless, also in t his case we have no indication on the 
reliability of this estimate. Kevork 201Cl| derives maximum likelihood estima- 



tors for the optimal order quantity and for the maximum expected profit. The 
asymptotic distribution for these estimators are then derived and asymptotic 
confidence intervals are extracted for the corresponding true quantities. Unfor- 
tunately, as the author remarks, these intervals are only asymptotically exact 
and do not provide the prescribed confidence for small sample sets. 

Conversely, by using our approach based on Neyman's framework, the con- 
fidence interval produced for the unknown parameter of the Binomial demand 
is always guaranteed to cover its actual value according to the prescribed confi- 
dence probability. This probability is controlled by the decision maker and in- 
fluences the size of the interval. Neyman's method differs advantageously from 
the Bayesian approach by being independent of a priori information about the 
unknown parameter. This approach remains valid even if the unknown param- 
eter is a random variable. The confidence interval for the unknown parameter 
immediately translates into a confidence interval for the order quantity and into 
a confidence interval for the actual cost. Intuitively, in the example presented in 
Section if we observe a set of 10 samples over and over again and we repeat 
our analysis, the intervals produced will cover the real optimum order quantity 
and the associate d cost acco rdin g to the prescr ibed probability. In contrast to 
the approaches of Hilll 1997 1 and Kevork 201Cll |. our approach provides explicit 



and exact likelihood guarantees that can be easily interpreted in the context 
of classical probability theory. Furthermore, by using confidence intervals, the 
decision maker has a better control on the risk of exceeding a certain cost and a 
better outlook on the range of order quantities that may be optimal according 
to the observed demands, especially when a limited set of samples is employed. 



6.2 Limitations and future works 



Our analysis is limited to three maxim um entropy probability distributions in 
the exponential family [Andersen . 197C| . each of which features a single param- 
eter that must be estimated. A maximum entropy probability distribution is a 
probability distribution whose entropy is at least as great as that of all other 
members of a specified class of dist ributions. Acc ording to Jaynes' "principle of 
maximum entropy", introduced in I JavnesI 1957 1, subject to known constraints 
(i.e. testable information), the probability distribution which best represents 
the current state of knowledge is the one with largest entropy. As shown by 
iHarremoed 2001 [. the Binomial and the Poisson are maximum entropy prob- 
ability distributions for the case in which all we know about the distribution 
of a random demand is that it has positive mean and discrete support that 
goes from to a maximum value N (Binomial) or to infinity (Poisson). The 
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Exponential distribution is the maximum entropy probability distribution for 
the case in which all we know about the distribution of a random demand is 
that it has positive mean and continuous support that goes from to infinity. 
These considerations show how broadly applicable the results in this work are. 
Remarkably, the Normal distribution — which is part of the exponential family 
and which is also a maximum entropy probability distribution — has not been 
considered. The analysis on the Normal distribution is complicated by the fact 
that two parameters, mean and variance, must be considered. Then a number 
of cases naturally arise: unknown mean and known variance, unknown variance 
and known mean, etc. For this reason, in order to keep the size and the scope 
of the discussion limited, we decided to leave this discussion as a future work. 



7 Conclusions 



Wc considered the problem of controlling the inventory of a single item with 
stochastic demand over a single period. We introduced a novel strategy to ad- 
dress the issue of demand estimation in single-period inventory optimization 
problems. Our strategy is based on the theory of statistical estimation. We em- 
ployed confidence interval analysis in order to identify a range of candidate order 
quantities that, with prescribed confidence probability, includes the real opti- 
mal order quantity for the underlying stochastic dcimand process with unknown 
parameter (s). In addition, for each candidate order quantity that is identified, 
our approach can produce an upper and a lower bound for the associated cost. 
We applied our novel approach to three demand distribution in the exponen- 
tial family: Binomial, Poisson, and Exponential. For two of these distributions 
we also discussed the case in which the decision maker faces unobserved lost 
sales. Numerical examples are presented in which we showed how our approach 
complements existing strategies based on maximum likelihood estimators or on 
Bayesian analysis. In particular, we showed that our approach does not provide 
a single order quantity recommendation and a point estimate for the associated 
cost, but — according to a prescribed confidence level — a set of candidate 
optimal order quantities and, for each of these, a confidence interval for the 
associated cost. This leaves more flexibility to the decision maker and provides 
advanced information that can be employed in order to better assess the impact 
of a given candidate optimal decision. 
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Appendix I: proofs of statements for Binomial de- 
mand 



Consider Eq. U in Section 13. 2[ it can be proved that Gq {q) is convex in the 
continuous parameter q. Firstly, we rewrite Eq. 2] as 



N 



GQ{q)=hiQ-Nq) + {p + h)J2i^-P^iBin{N-q)<i}). (9) 

i=Q 

We now show that the second derivative of this function is positive. Of course, 
this is equivalent to proving that 

j^{p + /i) ^(1 - PT{Bin{N; q) < i}) > 0. 
^ i=Q 

Theorem 3. For Q < N, 

12 ^ 

— J2{l-FT{Bin{N;q)<z}) 

is a positive function of q £ (0, 1). 

Proof. We introduce the following notation: 

f{i- N, q) = Vi{Bin{N; q) ^ i} ^ {1)q\l - g)^"', 
F{v, N, q) = Vv{Bin{N; q) < i}. 

We reduce the convexity of Gqlq) to showing that 

-3^F(z;7V,g)<0. 

Using the regularized incomplete beta function: 

F{t; N, q) = {N- i) [^^ J^' ' 1(1 - tydt- 

differentiating under the integral sign by Leibniz's rule: 

i^F{l-N,q) = -{N-z){D{l-q)^-^-^q^ 

= -iVr-i)g'(l-g)^— 1 
= -Nf{i;N-\,q) 

= -N[F{i;N-l,q)~F{i-l-N-l,q)]- 
using this recursive relationship: 

^F{i-N,q) = -7Vh(7V-l)(F(*;7V-2,g)-F(*-l;iV-2,q))+ 
{N - l){F{i -l-N-2,q)-F{i-2-N- 2, q))] 
= N{N - l)[f{i; N-2,q)- f{i-l;N~ 2, q)]; 
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and summing over i, all terms cancel out except the first and last: 

— J2 Fii-^ l) = ^(^ - N~2,q)~ f{Q~l;N~ 2, q)]. 

^ i=Q 

However, f{N; N ~2,q) — Q because it represents the probability of N successes 
in N — 2 trials, so 

— J2 ^(*; q) = -N{N ~ l)f{Q -l;N-2,q)<0. 

^ i=Q 



Appendix II: proofs of statements for Poisson de- 
mand 



Consider Eq. [5] in Section l4?2l it can be proved that Gq{X) is convex in the 
continuous parameter A. Firstly, we rewrite Eq. [5] as 

oo 

Gq(A) = h{Q - X) + {h + p)J2i^- Pr{Poisson{X) < i}). (10) 

We now show that the second derivative of this function is positive. Of course, 
this is equivalent to proving that 

J2 °° 

— 2 (/i + p) ^ (1 - Pi{Poisson{X) <i})>0. 

Therefore, we have to prove that 

— - V FT{Poisson(X] < i} > 0. 
dX^ ^ w - J - 

i=Q 

Theorem 4. ForQ>0, 

— ^ — y Pr{Poisson(X) < i} 

i=Q 

is a positive function of X > 0. 

Proof. The following derivations prove convexity for the above expression. 

ff °° 

— Pr{Poisson(A) < i} = 
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EE 



i=Q k=0 

T ooii^ , ooi \fe— 1 

i=Qfe=0 i=Qfe=l 



e"EEir--^EE 

i=Q fe=0 i=Q 



-A ' 



A'=- 



fc' ^ EEo^_x')i ^ E E _ xM ^ EE'('^_2v 

i=Qfc=l ^ ^' i=Qk=l^ '' i=Qfe=2^ ^ 



hh^^ hk(^-^ hh^^- 



°o / ' A*^ * A'^"-^ * A*^^^ 



^ (-CDF(Poisson(A), i) + 2CBF {Poisson{X), i - 1) - CDF{Poisson{X),i - 2)) 

i=Q 



-CBF{Poisson{\), Q) + 2CDF(Poisson(A), Q - 1) - CDF(Poisson(A), Q - 2) + 
-CDF(Poisson(A), Q + 1) + 2CDF(Poisson(A), Q) - CDF(Poisson(A), Q - 1) + 
-CDF(Poisson(A), Q + 2) + 2CDF(Poisson(A), Q + 1) - CDF(Poisson(A), Q) + , 



CDF(Poisson(A), Q - 1) - CDF(Poisson(A), (9-2) 

Q-l Q-2 f. 

e-^ V — - e-^ T — = 
fc! ^ A;! 

fe=0 fc=0 
\ fc=0 fc=0 / 
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Appendix III: proofs of statements for exponen- 
tial demand 



In this section we provide the proofs for the two theorems introduced in Section 
15.21 Consider the foUowing theorem, introduced in Section [ 



Theorem 1. limx^oGQ{X) = oo, hm^-j-oo Gq(A) = hQ~ , the function ad- 
mits a single global minimum X* , it is strictly increasing for A > A* and strictly 
decreasing for A < A* . 

Proof. The first fact, that is hmA_>.o Gq(A) = oo can be easily verified by simple 
algebraic derivations. We shall therefore prove that \im.\^oo Gq{X) = hQ~ and 
that the function admits a single global minimum. Let us .split Eq. into two 
parts 

Gq{X) = l^e-^'^ + (hQ ^) . (11) 



We shall consider the first term 



and the second term 



X 



e-^y, (12) 



hQ-^, (13) 



A' 

on the right hand side of Eq. separately. 

Firstly, we observe that, when X oo, Gq{X) -> hQ from below, that is 
hmA_>oo Gq{X) = hQ^ . This is due to the fact that i fl^) approaches zero faster 
than h/X does, i.e. 

h±P^-\Q 

Um = 0. 

A->oo !1 

From this fact we immediately infer that the derivative ofGQ{X) must be equal to 
zero for at least one value X other than infinity. Furthermore, the derivative of 
ilS^) is negative, strictly increasing for A > 0. The derivative of I113\) is positive 
strictly decreasing for A > 0. Therefore there exists only a single value of X 
for which the derivative of and the derivative of kl!^) add up to zero. This 
immediately implies that Gq{X) admits a single global minimum, it is strictly 
increasing for A > A* and strictly decreasing for A < A* . 



We now provide the proof for the second theorem introduced in Section 15.21 
Theorem 2. The lower bound is 

C*lb = Gq* iXub) 
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the upper bound is 

<fc = max{GQ-^(A;,,),GQ.JA„f,)}. 

Proof. Firstly, let us consider clf^. By definition, this is the expected total cost 
associated with the optimal order quantity Q;*^ for the largest possible value A„b 
that the demand rate takes in the confidence interval. Consider a demand rate A 
and the associated optimal order quantity Q\. By substituting Q in Eq. ^with 
the expression of the optimal order quantity in Eq. we immediately see that the 
expected total cost associated with an optimal order quantity is decreasing in 
the respective demand rate A — i.e. it is increasing w.r.t. the expected value 1/A 
of the demand — it immediately follows that there exists no other pair {Q\, A), 
where A G (A/{,,A„f,) that ensures a lower expected total cost. 

Let A € (A/b, A„f,) and Q G {Qib,Qub)- Consider a point in the two dimen- 
sional space X X Q, for which A = A and Q = Q. For any of such points, 
two cases can be observed, that is (i) Q > Q^, or (ii) Q < Q^. A strict 
equality can be reduced to any of these two cases. If we are in case (i), then 
^xiQ) < Cix{Qub) because of Theorem{l\ Q was already an order quantity larger 
than the optimal one, therefore Qub is also an order quantity larger than the op- 
timal one for a demand rate A. Consequently, if we increase the demand rate 
A (i.e. we decrease the expected demand l/X) our cost can only increase; this 
means that G^iQub) < G\^^{Qub)- If we are in case (ii), then G^iQ) < G^iQib) 
because of Theorem]^ Q was already an order quantity smaller than the opti- 
mal one, therefore Qib is also an order quantity smaller than the optimal one 
for a demand rate A. Consequently, if we decrease the demand rate A (i.e. we 
increase the expected demand 1/X) our cost can only increase; this means that 
G\{Qib) < G\i^{Qib). Therefore, the maximum cost, when we let X vary in 
{^ib,Kib) and Q vary in {Qib^Qub)' ^'^'^ either observed at {Q^f^^Xib) or at 

(Qub^ ^ub)- 



Appendix IV: plot for the expected total cost of 
the example in Section 15.41 

In Fig. [T]we provide a graphical outlook of the cost function discussed in Section 

EH 
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